CEVCLUS: evidential clustering with instance-level constraints for relational data
نویسندگان
چکیده
Recent advances in clustering consider incorporating background knowledge in the partitioning algorithm, using, e.g., pairwise constraints between objects. As a matter of fact, prior information, when available, often makes it possible to better retrieve meaningful clusters in data. Here, this approach is investigated in the framework of belief functions, which allows us to handle the imprecision and the uncertainty of the clustering process. In this context, the EVCLUS algorithm was proposed for partitioning objects described by a dissimilarity matrix. It is extended here so as to take pairwise constraints into account, by adding a term to its objective function. This term corresponds to a penalty term that expresses pairwise constraints in the belief function framework. Various synthetic and real datasets are considered to demonstrate the interest of the proposed method, called CEVCLUS, and two applications are presented. The performances of CEVCLUS are also compared to those of other constrained clustering algorithms. V. Antoine⋆ Université Blaise Pascal, PRES Clermont Université LIMOS, UMR CNRS 6158 BP 10125, F-630000 Clermont-Ferrand, France E-mail: [email protected] Tel.: +33-473-40-52-13 B. Quost · T. Denoeux Université de Technologie de Compiègne Laboratoire Heudiasyc, UMR CNRS 7253 Compiègne, France M.-H. Masson Université de Picardie Jules Verne, IUT de l’Oise Laboratoire Heudiasyc, UMR CNRS 7253 Compiègne, France ⋆This work has been mostly developed while the author was with Heudiasyc.
منابع مشابه
CEVCLUS: Constrained evidential clustering of proximity data
We present an improved relational clustering method integrating prior information. This new algorithm, entitled CEVCLUS, is based on two concepts: evidential clustering and constraint-based clustering. Evidential clustering uses the DempsterShafer theory to assign a mass function to each object. It provides a credal partition, which subsumes the notions of crisp, fuzzy and possibilistic partiti...
متن کاملRECM: Relational evidential c-means algorithm
A new clustering algorithm for proximity data, called RECM (Relational evidential c-means) is presented. This algorithm generates a credal partition, a new clustering structure based on the theory of belief functions, which extends the existing concepts of hard, fuzzy and possibilistic partitions. Two algorithms, EVCLUS (Evidential Clustering) and ECM (Evidential c-Means) were previously availa...
متن کاملCombining Data Clusterings with Instance Level Constraints
Recent work has focused the incorporation of a priori knowledge into the data clustering process, in the form of pairwise constraints, aiming to improve clustering quality and find appropriate clustering solutions to specific tasks or interests. In this work, we integrate must-link and cannot-link constraints into the cluster ensemble framework. Two algorithms for combining multiple data partit...
متن کاملClustering with Instance-level Constraints
Clustering algorithms conduct a search through the space of possible organizations of a data set. In this paper, we propose two types of instance-level clustering constraints – must-link and cannot-link constraints – and show how they can be incorporated into a clustering algorithm to aid that search. For three of the four data sets tested, our results indicate that the incorporation of surpris...
متن کاملMedian evidential c-means algorithm and its application to community detection
Median clustering is of great value for partitioning relational data. In this paper, a new prototype-based clustering method, called Median Evidential C-Means (MECM), which is an extension of median c-means and median fuzzy c-means on the theoretical framework of belief functions is proposed. The median variant relaxes the restriction of a metric space embedding for the objects but constrains t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Soft Comput.
دوره 18 شماره
صفحات -
تاریخ انتشار 2014